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(54) Off-axis levelling In lithographic projection apparatus 



(57) In an off-axis levelling procedure a height map 
of the substrate is generated at a measurement station. 
The height map is referenced to a physical reference 
surface of the substrate table. The physical reference 
surface may be a surface In which is inset a transmis- 
sion image sensor. At the exposure station the height of 
the physical reference surface is measured and related 



to the focal plane of the projection lens. The height map 
can then be used to determine the optimum height 
and/or tilt of substrate table to position the exposure 
area on the substrate in best focus during exposure. 
The same principles can be applied to (reflective) 
masks. 
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Description 

(0001 ] The present invention relates to height detection and levelling, for example of the substrate and/or mask, in 
lithographic apparatus. More particularly, the invention relates to a system for off-axis levelling in lithographic projection 
s apparatus comprising: 

a radiation system for supplying a projection beam of radiation; 
a first object table provided with a mask holder for holding a mask; 
a second, movable object table provided with a substrate holder for holding a sut>strate; 
10 a projection system for Imaging an inBdiated portion of the mask onto a target portion of the substrate; and 

a positioning system for moving said second object table between an exposure position, at which saki projection 
system can image said mask portkm onto said substrate, and a measurement position. 

[0002] For the sake of simplicity, the projectton system may hereinafter be referred to as the "lens"; however, this 
IS term should be broadly interpreted as encompassing various types of projection system, including refractive optics, 
reflective optics, catadioptric systems. arxJ charged particle optics, for example. The radiation system may also include 
elements operating according to any of these principles for directing, shaping or controlling the projection beam, and 
such elements may also be referred to below, collectivety or singularly, as a "lens". In addition, the first and second 
object tables may be referred to as the "mask table" and the "substrate table", respectively Further, ttie lithographic 
20 apparatus may be of a type having two or more mask tables and/or two or more substrate tables. In such "multiple 
stage" devices the additional tables may be used in parallel, or preparatory steps may be cam'ed out on one or more 
tables while one or more other tables are being used Ibr exposures. 

[0003] Lithographic projection apparatus can be used, for example, in tiie manufacture of integrated circuits (ICs). 
In such a case, the mask (reticle) may contain a circuit pattern con-esponding to an individual layer of the IC. and this 

25 pattern can be imaged onto an exposure area (die) on a substrate (silicon wafer) which has been coated with a layer of 
photosensitive material (resist). In general, a single wafer will contain a whole network of adjacent dies which are suc- 
cessively irradiated via the reticle, one at a time. In one type of lithographic projection apparatus, each die is irradiated 
by exposing tiie entire reticle pattern onto the die in one go; such an apparatus is commonly referred to as a wafer step- 
per. In an alternative apparatus — which is commonly referred to as a step-and-scan apparatus — each die is irradiated 

30 by progressively scanning tiie reticle pattern under the projection beam in a given reference direction (tiie "scanning" 
direction) while synchronously scanning the wafer table parallel or anti-parallel to this direction; since. In general, the 
projection system will have a magnification factor M (generally < 1). the speed V at which the wafer table is scanned will 
be a tactor M times that at which the reticle table is scanned. More information with regard to lithographic devices as 
here described can be gleaned from Intemational Patent Application WO 97/33205. for example. 

35 [0004] Until very recentiy. lithographic apparatus contained a single mask table and a single substBte table. How- 
ever, machines are now becoming available in which there are at least two independently moveable substrate tables; 
see, for example, tiie muKi-stage apparatus described in International Patent Applications W098/28665 and 
WO98/40791 . The basic operating principle behind such multi-stage apparatus is that, while a first substrate table is at 
the exposure position underneath the projection system for exposure of a first sut>strate located on that tak>le. a second 

40 substrate table can run to a loading position, discharge a previously exposed substrate, pick up a new substrate, per- 
form some initial measurements on the new substrate and then stand ready to transfer tiie new substrate to the expo- 
sure position underneath the projection system as soon as exposure of ttie first substiBte is completed; the cyde then 
repeats. In this manner it is possible to increase substantially the machine throughput, which in turn inrproves the cost 
of ownership of tiie machine. It should be understood that the same principle could be used with just one substrate table 

45 which is moved between exposure and measurement position. 

[0005] The measurements performed on the sul>strate at the measuremient position may, for example, include a 
determination of the spatial relationship (in X & Y directions) between various contemplated exposure areas on the sub- 
strate ("dies"), reference markers on tiie substrate and at least one reference marker (e.g. fiducial) located on the sub- 
sto^ate table outside the area of tiie substi^ate. Such information can subsequentiy be employed at the exposure position 

so to perform a fast and accurate X and Y positioning of the exposure areas witti respect to the projection k>eam; for more 
information see WO 99/32940 (P-0079). for example. This document also desaibes the preparation at the measure- 
ment position of a height map relating the Z position of the substrate surface at various points to a reference plane of 
the substrate holder. However the reference plane is defined by a Z-interferometer at the measurement position and a 
different Z-interferometer is used at the exposure position. It is therefore necessary to know accurately the relationship 

55 between the origins of the two Z-interferometers. 

[0006] An object of the present invention is to provide a system for off-axis levelling a substrate in a lithographic pro- 
jection apparatus tiiat avoids the need to relate tiie origins of two interferometer systems and enables additional 
improvements in positioning of the exposure areas during exposure processes. 
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[0007] Aocording to the present invention tliere is provided a litliograpliic projection apparatus comprising: 

a radiation system for supplying a projection beam of radiation; 
a first object table provided with a mask holder for holding a mask; 

5 a second, movable object table provided with a substrate holder for holding a substrate; 

a projection system for imaging an irradiated portion of the mask onto a target portion of the substrate; and 
a positioning system for moving said second object table between an exposure station, at which said projection 
system can image said mask portion onto said substrate, and a measurement station; characterized in that 
said second object table has a physical reference surface fixed thereto; 

10 and by: 

height mapping means located at said measurement station and constructed and arranged to measure the height, 
relative to said physical reference surface* of a plurality of points on the surface of a substrate held on said sub- 
strate holder and to create a height map thereof; 

position measuring means located at said exposure station for measuring the position of said physical reference 
IS surface in a first direction substantially perpendicular to said substrate surface, after movement of said second 
object table to said exposure station; and 

control means constructed and an^anged to control the position of said second object table in at least said first 
direction, during exposure of said target portion, in accordance with said height map and said posKion measured 
by said position measuring means. 

20 

[0008] According to a further aspect of the present invention there is provided a method of manufiacturing a device 
using a lithographic projection apparatus comprising: 

a radiation system for supplying a projection t^m of radiation; 
2S a first object table provided with a mask holder for homing a mask; 

a second, movable object table provided with a substrate holder for holding a substrate; and 

a projection system for imaging irradiated portions of the mask onto target portions of the substrate at an exposure 

station; the method comprising tiie steps of: 

providing a mask besuring a pattern to said first object table; 
30 providing a substrate having a radiation-sensitive layer to said second object table; and 

imaging said inradiated portions of tiie mask onto said target portions of the substrate; characterized by the steps 

of: 

before said step of imaging, generating, with the second object tak>le at a measurement station, a height map indi- 
cating the height of a plurality of points on the substrate surface relative to a physical reference surface on said sec- 
35 ond object table; 

moving the second object table to said exposure station and measuring the position of saki physical reference sur- 
face in a first direction substantially perpendicular to sakl substrate surface; and 

during sakJ step of imaging, positioning the second object table in at least said first direction by reference to said 
height map and sakil measured position in said first direction of said physical reference surface. 

40 

[0009] In a manufacturing process using a lithographic projection apparatus according to the invention a pattern in 
a mask is imaged onto a substrate which is at least partially covered by a layer of energy-sensitive material (resist). 
Prior to this imaging step, the sut>strate may undergo various procedures, such as priming, resist coating and a soft 
bake. After exposure, the substrate may be subjected to other procedures, such as a post-exposure bake (PEB), devel- 

46 opment, a hard bake and measurement/inspection of the imaged features. This anray of procedures is used as a basis 
to pattern an individual layer of a device, e.g. an IC. Such a patterned layer may then undergo various processes such 
as etching, ion-implantation (doping), metallisation, oxklation. chemo-mechanical polishing, etc., all intended to finish 
off an individual layer. If several layers are required, then the whole procedure, or a variant thereof, wilt have to be 
repeated for each new layer. Eventually, an array of devices (dies) will be present on the substrate (wafer). These 

so devices are then separated from one another by a technique such as dicing or sawing, whence the individual devices 
can be mounted on a carrier, connected to pins. etc. Further information regarding such processes can be obtained, for 
example, from the book "Miaochip Fabrication: A Practical Guide to Semiconductor Processing". Third Edition, by Peter 
van Zant, McGraw Hill Publishing Co., 1997. ISBN 0-07-067250-4. 

[001 0] Atthough specific reference may be made in this text to ttie use of ttie apparatus according to tiie invention 
55 in the manufacture of ICs. it shouM be explidtiy understood that such an apparatus has many other possible applica- 
tions. For example, it may be employed in the manufacture of integrated optical systenfis, gudance and detection pat- 
terns for magnetic domain memories, fiquid-crystal display panels, tfiin-f ilm magnetic heads, etc. The skilled artisan will 
appreciate that, in the context of such alternative applications, any use of the terms "reticle", 'Nwafer" or "die" in this text 
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should be considered as being replaced by the more general terms "mask**, "substrate** and exposure area", respec- 
tively. 

[001 1 ] In the present document, the terms "radiation" and "beam" are used to encompass all types of electromag- 
netic radiation or particle flux, including, but not limited to. ultraviolet radiation (ag. at a wavelength of 365nm, 248nm, 

5 193nm. 157nm or 126nm), extreme ultraviolet radiation (EUV), X-rays, electrons and ions. Also herein, the invention is 
described using a reference system of orthogonal X. Y and Z directions and rotation about an axis parallel to the / direc- 
tion is denoted R/. Further, unless the context othenwise requires, the term "vertical" (Z) used herein is intended to refer 
to the direction normal to the substrate or mask surface, rather than implying any particular orientation of the apparatus. 
[001 2] The present invention will be described below with reference to exemplary embodiments and the accompa- 

10 nying schematic drawings, in which: 

Figure 1 depicts a lithographic projection apparatus according to a first embodiment of the invention; 

Rgure 2 is a view showing how the wafer height is determined from measurements by the level sensor and the Z- 

interferometer; 

IS Rgures 3 to 6 are views showing various steps of the off-axis levelling procedure according to the present inven- 
tion; 

Figure 7 is a plan view of a substrate table showing the sensors and f idudals used in the off^axis levelling procedure 
according to the present invention; 

Figure 8 is a side view of the exposure and measurement stations in a second emkxxJiment of the invention; 
20 Figure 9 is a flow diagram illustrating various steps of the measurement process carried out at the measurement 
station of the second embodiment of the invention; 

Rgure 10 is a flow diagram illustrating various steps of the exposure process carried out at the exposure station of 
the second embodiment of the present invention; 

Figure 1 1 is a diagram illustrating the scan pattern usable to measure the height map of the present invention; 
25 Rgure 12 is a diagram illustrating an aKernative scan pattern usable to measure the height map of the present 
Invention; 

Rgure 13 is a diagram illustrating the global level contour process in the second embodiment of the present inven- 
tion; 

Figure 14 and its sub-Figures A to G illustrate the structure and operation of a presently prefened embodiment of 
30 a level sensor usable in the invention; 

Rgure 15 is a graph showing detector output vs. substrate table position for a capture spot of the level sensor of 
Rgure 14; 

Rgure 1 5A is a diagram showing the arrangements of detector segments for the capture spot of the level sensor of 
Rgure 14; 

35 Rgures 1 6 and 1 7 are diagrams illustrating a presentiy preferred embodiment of a confidence sensor usable in the 
second embodiment of the invention; 

Rgure 18 is a diagram of a beam splitter usable in the confidence sensor of Figures 1 6 and 1 7; 

Figure 19 is a diagram used to explain a Z-interferometer calibration procedure usable in embodiments of the 

invention; 

40 Figure 20 is a diagram illustrating the notation used in describing an exposure trajectory optimization procedure 
according to a third embodiment of the invention; and 

Rgure 21 depicts a lithographic projection apparatus according to a fifth embodiment of the invention. 
[0013] In the drawings, like references indicate like parts. 

45 

Embodiment 1 

[001 4] Rgure 1 schematically depicts a lithographic projection apparatus according to the invention. The apparatus 
comprises: 

so 

• a radiation system LA Ex, IN, CO for supplying a projection beam PB of radiation (e.g. UV or EUV radiation); 

• a first object table (mask table) MX provided with a mask holder for holding a mask MA (e.g. a reticle), and con- 
nected to first positioning means for accurately positioning the mask with respect to item PL; 

• a second object table (substi-ate or wafer table) WTa provided with a substrate holder for holding a substrate W (e.g. 
S5 a resist-coated silicon wafer), and connected to second positioning means for accurately positioning the substrate 

with respect to item PL; 

a third object table (substrate or wafer table) WTb provided witii a substrate hokJer for holding a sutjstrate W (e.g. 
a resist-coated silicon wafer), and connected to third positioning means for accurately positioning the substrate witti 
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respect to item PL; 

a measurement system MS for performing measurement (chaFacterization) processes on a substrate held on a 
substrate table WTa or WTb at a measurement station: 
• a projection system ("lensT PL (e.g. a refractive or catadioptric system, a mirror group or an array of field deflec- 
5 tors) for imaging an irradiated portion of the mask MA onto an exposure area C (die) of a substrate W held in a suk>- 
strate table WTa or WTb at an exposure station. 

[0015] As here depicted, the apparatus is of a transmissive type (l-e. has a transmissive masl^. However. In gen- 
eral, it may also be of a reflective type, for example. 

10 [001 6] The radiation system comprises a source LA (e.g. a Hg lamp, examiner laser, an undulator provided around 
the path of an electron beam in a storage ring or synchrotron, a laser plasma source or an electron or ion beam source) 
which produces a beam of radiation. This beam Is passed along various optical components comprised in the illumina- 
tion system. — e.g. beam shaping optics Ex. an integrator IN and a condenser CO — so that the resultant beam PB 
has a desired shape and intensity distribution in its cross-section. 

IS [0017] The beam PB sut>sequently intercepts the mask MA which is held in a mask holder on a mask table MT. 
Having passed through the mask MA. the beam PB passes through the lens PL, which focuses the beam PB onto an 
exposure area C of the substrate W. With the aid of the interferometric displacement and measuring means IF. the sub- 
strate tables WTa, WTb can be moved accurately by the second and third positioning means, e.g. so as to position dif- 
ferent exposure areas C In the path of the beam PB. Similariy, the first positioning means can be used to accurately 

20 position the mask MA with respect to the path of tiie beam PB. e.g. after mechanical retrieval of the mask MA from a 
mask library. In general, movement of the object tables MT, WTa, WTb will be realized with the aid of a long stroke mod- 
ule (course positioning) and a short stroke module (fine positioning), which are not explicitly depicted in Rgure 1. In the 
case of a waferstepper (as opposed to a step-and-scan apparatus) the reticle tattle may be connected only to a short 
stroke positioning device, to make fine adjustments in mask orientation and position. 

25 [0018] The second and third positioning means may be constructed so as to be able to position their respective 
substrate tak)les WTa, WTb over a range encompassing both the exposure station under projection system PL and the 
measurement station under the measurement system MS. Alternatively, the second and third positioning means may 
be replaced by separate exposure station and measurement station positioning systems for positioning a substrate 
table in the respective exposure stations and a table exchange means for exchanging the substrate tables between tiie 

30 two positioning ^tems. Suitable positioning systems are described, inter alia, in WO 98/28665 and WO 98/40791 
mentioned above. It shouM be noted that a lithography apparatus may have multiple exposure stations and/or multiple 
measurement stations and that the numbers of measurement and exposure stations may be different than each other 
and tiie total number of stations need not equal the number of substrate tables. Indeed, tiie principle of separate expo- 
sure and measurement stations may be employed even with a single substrate table. 

35 [001 9] The depicted apparatus can be used in two different modes: 

1. In step-and-repeat (step) mode, tiie mask table MT Is kept essentially stationary, and an entire mask image is 
projected in one go (i.e. a single llash') onto an exposure area C. The substrate table WT Is ttien shifted in the X 
and/or Y directions so that a different exposure area C can be irradiated by the beam PB; 

40 2. In step-and-scan (scan) mode, essentially the same scenario applies, except that a given exposure area C is not 
exposed in a single Hash". Instead, the mask tat>le MT is movable in a given direction (tiie so-called "scan direc- 
tion", e.g. the Y direction) witii a speed v, so that the projection beam PB is caused to scan over a mask image: 
concurrentiy, the substrate table WTa or WTb is moved in the same or opposite direction at a speed V = Mv . in 
which M is tiie magnification of the lens PL (typically. M » 1M or 1/5). In this manner, a relatively large exposure 

45 area C can be exposed, without having to compromise on resolution. 

[0020] An important factor influencing the imaging quality of a lithographic apparatus is the accuracy with which the 
mask image is focused on the substrate. In practice, since the scope for adjusting tiie position of the focal plane of ttie 
projection system PL is limited and tiie depth of focus of that system is small, this means ttiat the exposure area of the 

50 wafer (sut3strate) must be positioned precisely in ttie focal plane of the projection system PL. To do ttiis. it is of course 
necessary to know both the position of the focal plane of tiie projection system PL and the position of the top surface 
of tiie wafer. Wafers are polished to a very high degree of flatness but nevertheless deviation of tiie wafer surface from 
perfect flatness (referred to as "urtflatness") of sufficient magnitude rxrticeably to affect focus accuracy can occur. 
Unf latness may be caused, for example, by variations in wafer thickness, distortion of tiie shape of the wafer or contam- 

55 inants on tiie wafer holder. The presence of structures due to previous process steps also significantiy affects the wafer 
height (flatness). In the present invention, the cause of unf latness is largely irrelevant; only the height of the top surface 
of the wafer Is considered. Unless the context otherwise requires, references below to "tiie wafer surface" refer to tiie 
top surface of the wafer onto which will be projected the mask image. 
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[0021 ] According to the invention, after loading a wafer onto the substrate table, the height of the wafer surface Zyy^. 
fer relative to a physical reference surfece of the substrate table is mapped. This process is carried out at the measure- 
ment station using a first sensor, referred to as the level sensor, which measures the vertical (Z) position of the physical 
reference surface and the vertical position of the wafer surface, Z|_s, at a plurality of points, and a second sensor, for 

5 example a Z-interferometer, which simultaneously measures the vertical position of tiie sutistrate table, Z|f at the same 
points. As shown in Figure 2, the wafer surface height is determined as Z yy^i^r = Z - Z ,p . The substrate table carry- 
ing the wafer is then transferred to the exposure station and the vertical position of the physical reference surface is 
again determined. The height map is then refen-ed to in positioning the wafer at the con-ect vertical position during the 
exposure process. This procedure is described in more detEul below with reference to Figures 3 to 6. 

10 [0022] As shown in Figure 3, first the substrate table is moved so that a physical reference surfece fixed to the sub- 
strate table is underneath the level sensor LS. The physical reference surface may be any convenient surface whose 
position in X. Y and Z on the substrate talkie will not change during processing of a wafer in the lithographic apparatus 
and. most importantly, in tiie transfer of the substrate table between measurement and exposure stations. The physical 
reference surface may be part of a fiducial containing other alignment markers and should have such properties as 

IS allow Its vertical position to be measured by the same sensor as measures the vertical position of the wafer surface. In 
a presentiy preferred embodiment the physical reference surface is a reflective surface in a fiducial in which is inset a 
so-called transmission image sensor (TIS). The TIS is described further below. 

[0023] The level sensor may be. for example, an optical sensor such as that described in US 5.191,200 (P-0039) 
(referred to therein as a focus error detection system); alternatively, a pneumatic or capacitive sensor (for example) is 
20 conceivable. A presently preferred form of sensor making use of Moir6 patterns formed between the image of a projec- 
tion grating reflected by tiie wafer surface and a fixed detection grating is described below in relation to a second 
embodiment of the invention. The level sensor may measure the vertical position of a plurality of positions simultane- 
ously and for each may measure the average height of a small area, so averaging out unf latnesses of high spatial fre- 
quencies. 

25 [0024] Simultaneously with the measurement of the vertical position of a physical reference surface by the level 
sensor LS. the vertical position of tiie substrate table is measured using the Z-interferometer. Z|p The Z-tnterferometer 
may, for example, be pan of a ttiree. five or six-axis interferometric metrology system such as that described in WO 
99/28790 (P-0077) or WO 99/32940 (P-0079). The Z-interferometer system preferably measures the vertical position 
of the suk)strate table at a point having the same position in the XY plane as the calibrated measurement position of the 

30 level sensor LS. This may be done by measuring tiie vertical position of two opposite sides of the substrate table WT at 
points in line with tiie measurement position of the level sensor and interpolating/modelling between them. This ensures 
that, in the event that the wafer table is tilted out of the XY plane, the Z-interferometer measurement oorrectiy indicates 
the vertical position of the substrate tat>le under the level sensor. 

[0025] Preferably, this process is repeated with at least a second physical reference surface spaced apart. e.g. 
35 diagonally, from the first physical reference surfece. Height measurements from two or more positions can then be used 
to define a reference plane. 

[0026] The simultaneous measurement of the vertical position of one or more physical reference surfaces and the 
vertical position of the substrate table establishes a point or points determining the reference plane relative to which the 
wafer height is to be mapped. A Z-interferometer of the type mentioned above is effectively a displacement sensor 

40 rather than an absolute sensor, and so requires zeroing, but provides a highly linear position measurement over a wide 
range. On the other hand, suitable level sensors, e.g. those mentioned above, may provide an absolute position meas- 
urement with respect to an externally defined reference plane (i.e. nominal zero) but over a smaller range. Where such 
sensors are used, it is convenient to move the substrate table vertically under the level sensor until the physical refer- 
ence surface(s) is (are) positioned at a nominal zero in the middle of the measurement range of the level sensor and to 

45 read out the current interferometer Z value. One or more of these measurements on physical reference surfaces will 
establish the reference plane for the height mapping. The Z-interferometer is then zeroed with reference to the refer- 
ence plane. In tfiis way the reference plane is related to the physical surface on the substrate table and the Zwafer height 
map is made independent of the initial zero position of the Z-interferometer at the measurement station and other tocal 
Actors such as any unflatness in tiie base plate over which the substrate table is moved. Additionally, the height map 

50 is made independent of any drift in the zero position of the level sensor. 

[0027] As illustrated in Figure 4. once the reference plane has been established, the substrate table is moved so 
that the wafer surface is scanned underneath the level sensor to make the height map. The vertical position of the wafer 
surface and the vertical position of the substrate table are measured at a plurality of points of known XY position and 
subtracted from each other to give the wafer height at the known XY positions. These wafer height values form the wafer 

55 height map which can be recorded in any suitable form. For example, the wafer height values and XY coordinates may 
be stored togetiier in so-called indivisit^le pairs. Alternatively, the points at which wafer height values are taken may be 
predetermined, e.g. by scanning the wafer along a predetermined path at a predetermined speed and making meas- 
urements at predetermined intervals, so that a simple list or array of height values (optionally together with a small 
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number of parameters defining the measurement pattern and/or a starting point) may suffice to define the height map. 
[0028] The motion of the substrate table during the height mapping scan is largely only in the XY plana However, 
if the level sensor LS is of a type which only gives a reliable zero reading, the substrate table is also moved vertically to 
keep the wafer surface at the zero position of the level sensor. The wafer height is then essentially derived from the Z 

5 movements of the substrate table, as measured by the Z-interferometer, necessary to maintain a zero readout from the 
level sensor. However, it is preferable to use a level sensor that has an appreciable measurement range over which its 
output is linearly related to wafer height, or can be linearized. Such measurement range ideally encompasses the max- 
imum expected, or permissible, variation in wafer height With such a sensor, the need for vertical movement of the sut>- 
strata table during the scan is reduced or eliminated and the scan can be completed faster, since the scan speed is then 

10 limited by the sensor response time rather than by the ability of the short stroke sut>strate table to track the contour of 
the wafer in three dimensions. Also, a sensor with an appreciable linear range can allow the heights at a plurality of posi- 
tions (e.g. an array of spots) to be measured simultaneously 

[0029] Next, the wafer table is moved to the exposure station and, as shown in Figure 5, the (physical) reference 
surface is positioned under the projection lens so as to allow a measurement of its vertical position relative to the focal 

IS plane of tiie projection lens. In a preferred embodiment, this is achieved using one or more transmission image sensors 
(described below) whose detector is physically connected to the reference surface used in the earlier measurements. 
The transmission image sensor(s) can determine the vertical focus position of the projected image from the mask under 
the projection lens. Armed with this measurement, the reference plane can be related to the focal plane of the projection 
lens and a path for the sut^strate table in tiiree-dimensions which keeps the wafer surface in optimum focus can be 

20 determined. One method by which this can be done is to calculate Z, Rx and Ry setpoints for a series of points along 
the scan path. The setpoints are determined using a least squares method so as to minimize the difference between 
the wafer nfiap data and the focus plane of the exposure slit image. For ease of calculation, the relative motion of the 
exposure slit image and wafer can be expressed as the slit moving relative to a static wafer. The least squares criterion 
can then be expressed as. fbr each time f. finding the values Z(t), Rx(t) and Ry(t) which give a minimum value of: 

25 

- ^ f/2 W/2 

= I lk*> rzr*;+ ^'Ry(t)- onRxftjjfdKdy w 

30 

where wfx.yj is the wafer height map arxl the exposure slit image is a rectangular plane of width s in the scanning direc- 
tion and lengtii W perpendicular to the scanning direction with its position defined by z(t), Rx(t) and Ry(t). The setpoints 
and tiie wafer trajectory can be expressed as functions of either Y (position in the scanning direction) or t (time) since 
these are related by >^ = y o+ ^ ^ where yo is the starting position and v is the scanning speed. 

35 [0030] As mentioned above, the physical reference surface(s) is (are) preferak)ly a surface in which a ti-ansmission 
image sensor (TIS) is inset. As shown in Rgure 7, two sensors TIS1 and TIS2 are mounted on a fiducial plate mounted 
to tiie top surface of the substrate table (WT, WTa or WTb). at diagonally opposite positions outside the area covered 
by the wafer W. The fiducial plate is made of a highly stable material witii a very low coefficient of tiiermal expansion. 
e.g. Invar, and has a flat reflective upper surface which may carry markers used in alignment processes. TIS1 and TIS2 

40 are sensors used to determine directly the vertical (and horizontal) position of the aerial image of the projectbn lens. 
They comprise apertures in the respective surface close behind which is placed a photodetector sensitive to the radia- 
tion used for the exposure process. To determine the position of the focal plane, the projection lens projects into space 
an image of a TIS pattern TIS-M provided on the mask MA and having contrasting light and dark regions. The substrate 
stage is then scanned horizontally (in one or preferably two directions) and vertically so that the aperture of the TIS 

45 passes tiirough the space where the aerial image is expected to be. As the TIS aperture passes through the tight and 
dark portions of the image of the TIS pattern, the output of the photodetector will fluctuate. The vertical level at which 
the rate of change of amplitude of the photodetector output is highest indicates the level at which tiie image of TIS pat- 
tern has tiie greatest contiast and hence Indicates the plane of optimum focus. An example of a TIS of this type Is 
desalbed in greater detail in US 4.540.277. Instead of the TIS. a Reflection Image Sensor (RIS) such as that described 

so in US 5. 1 44,363 may also be used. 

[0031] Using tiie surface of the TIS as tiie physical reference surface has the advantage that the TIS measurement 
directiy relates ttie reference plane used for the height map to the focal plane of the projection lens and so the height 
map can be employed directiy to give height corrections for the wafer stage during the exposure process. This is illus- 
trated in Figure 6. which shows tiie substrate table WT as positioned under the confrol of the Z-interferometer at a 

55 height determined by the height map so that the wafer surface is at tiie correct position under tiie projection lens PL. 
[0032] The TIS surface may additionally carry reference markers whose position is detected using a through-tiie- 
lens (TTL) alignment system to align the substrate table to the mask. Such an alignment system is described in EP- 
0.467.445 A (P-0032). fbr example. Alignment of indivkJual exposure areas can also be carried out. or may be obviated 
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by an alignment procedure carried out at the measurement stage to align the exposure areas to the reference markers 
on the wafer stage. Such a procedure is described in EP-0 306 590 A (P-0070) for example. 
[0033] H will be appreciated that the mask image projected by the projection system PL in a production process. In 
both step-and-repeat and step-and-scan modes, is not a single point but extends over a significant area in the XY plane. 

5 Since the wafer height may vary significantly over this area it is desirable to optimize the focus over the whole of the 
projection area, rather than just at a single point. In embodiments of the present invention, this can t>e achieved by con- 
trolling not only the vertical position of the substrate table WT. but also its tilt about the X and Y axes (Rx. Ry). With 
knowledge of the location and extent of the intended exposure areas, the height map generated by the present inven- 
tion can be used to calculate in advance optimum Z, Rx and Ry position setpoints for the sut)strate table for each expo- 

10 sure. This avoids the time required for levelling in known apparatus that only measure wafer height when the wafer Is 
positioned under the projection lens and hence increases throughput. The optimum Z. Rx and Ry setpoints may be cal- 
culated by various known mathematical techniques, Ibr example using an Iterative process to minimize defocus (defined 
as the distance between the wafer surface and tiie ideal focal plane). LSO (t), integrated over tiie exposure area. 
[0034] A further advantage is possible in the step-and-scan rrxxie. In tills mode, tiie projection lens projects an 

IS image of only part of the mask pattern onto a corresponding part of the exposure area The mask and substrate are 
tiien scanned in synchronism through the object and image focal planes of the projection system PL so that the entire 
mask pattern is imaged onto the whole exposure area. Alttiough in practice the projection lens is held stationary and 
the mask and substrate are moved, it is often convenient to consider this process in terms of an image slit moving over 
the wafer surface. With the height map determined In advance by the present invention, it Is possible to calculate a 

20 sequence of Z. Rx and Ry setpoints matched to the XY scan path (usually, scanning takes place in only one direction, 
e.g. Y). The sequence of setpoints can be optimized according to ^ditional aiteria. e.g. to minimize vertical accelera- 
tions or tilt motions that might reduce throughput or induce undesirable vibrations. Given a sequence of spaced-apart 
setpoints, a scanning trajectory for the exposure can be calculated using a polynomial or spline fitting procedure. 
[0035] Whilst the present invention aims to position tiie wafer at the optimum position in Z. Rx and Ry for a given 

25 exposure, the variations in wafer surface height over the exposure area may be such ttiat the wafer cannot be posi- 
tioned to give adequate focus over the entire area. Such so-called focus spots can result in an exposure failure. How- 
ever, with the present invention such failures can be predicted in advance and remedial action can be taken. For 
example, the wafer may be stripped and recoated without the detrimental effect of further processing a badly exposed 
wafer. Alternatively. If tiie predicted failure affects only one or a few devices on tiie wafer whilst ottiers will be acceptable. 

30 throughput may be enhanced by skipping exposures tiiat can be predicted in advance to result in defective devices. 
[0036] A further advantage of focus-spot detection can be derived from analysis of height maps taken. When large 
deviations from a glot>al wafer plane are present in a wafer height map. this could indicate focus spots due to substrate 
unflatness or process influences. Comparing wafer height maps from several wafers can indicate focus spots due to 
contamination or unflatness of the substrate table. When focus spots appear at identical or near-identical positions for 

35 different wafers, this is most likely caused by substrate holder contamination (so-called "chuck-spots"). From one wafer 
height map. one can also compare tiie height map (topology) from repeated exposure areas (dies). If large differences 
occur at certain dies with respect to an average height map. one can suspect fbcus spots due to eitiier wafer processing 
or the sut)strate table. Instead of comparing wafer height maps, the same comparisons can also be done on the derived 
exposure paths per die. or on the defocus parameters MA. MSD or Moving Focus explained below. When a certain die 

40 or wafer deviates much from an average exposure path or defocus parameters, focus spots can also be detected. 
[0037] All of the above mentioned analysis can be done before a wafer is exposed, and remedial action, such as 
wafer rejection (processing influences) or substrate holder cleaning (chuck spots), can be taken. With these methods, 
fbcus spots can be localised to the size of the measurement spot of the level sensor 10. This implies a much higher 
resolution than previous methods of fbcus spot detection. 

45 

Embodiment 2 

[0038] A second errixxJiment of the present invention is shown in Figure 8, which shows only the exposure and 
measurement stations and only components relevant to the discussbn below. The second embodiment utilizes the lev- 

50 elling principle of the present invention described above, togettier with certain refinements described below. 

[0039] At the exposure station, to the left of Figure 8, tiie projection lens PL is shown mounted to metrology frame 
MF and projecting an image of TIS marker TIS-M on mask MA onto the sensor TIS mounted to wafer table WT. The 
metrology frame is isolated from the tiansmission of vibrations from other parts of the apparatus and has mounted on 
it only passive components used for fine metrology and alignment sensing. The whole metrology frame may be made 

55 of a material witii a very low coefficient of thermal expansion, such as Invar, so that it provides a very stable platform 
for tiie most sensitive measuring devices of the apparatus. The components mounted on tiie metrology frame MF 
include mirrors 34 and 35 to which the measurement beams of the Z-interferometer Zip are directed by 45*>-min'ors 31 
mounted on tiie sides of the wafer table WT. To ensure tiiat the Z position of the subsbBte table can be measured 
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throughout its range of movement in X. the mirrors 34, 35 have a correspondingly large extent in the X direction. To 
ensure the Z position can be measured throughout the range off Y-movement. the minors 31 cover the whole length of 
the wafer table. Also mounted to the metrology frame MF are the beam generating and receiving parts 21a. 22a of a 
confidence sensor 20a described further below. 

5 [0040] At the measurement station(on the right in Rgure 8). the same metrology frame MF carries mirrors 33 and 
32 which serve the same function as the mirrors 34, 35 at the exposure station: again mirrors 32. 33 will have a large 
extent in the X direction to accommodate the required range of movement of the substrate table WT which is just as 
large as that at the exposure station. Level sensor 10. comprising beam generating part 1 1 and detection part 12. is 
also mounted on the metrology frame MR Additionally, a confidence sensor 20b. essentially the same as confidence 

10 sensor 20a at tiie exposure posftion. is provided. Other measurement devices, for example an alignment module, can 
also be provided. 

[0041 ] As discussed above, the use of the physical reference surface (again In tills embodiment tiiis is provided by 
the upper surface of the TIS) relates the wafer height map to tfie wafer stage and makes it independent of tiie zero posi- 
tions of tiie two Z-interferometers and certain local factors such as unflatness of tiie base plate (stone) BP over which 

IS the wafer tables move. However, since the wafer height map is generated using tiie Z-interferometer at tiie measure- 
ment station and the substrate table position is controlled at the exposure station using a different Z-interferometer pro- 
vided tiiere. any differences as a function of XY position between the two Z-interferometers can affect ttie accuracy wHh 
which the wafer surface is positioned in the focal plane. The principal cause of these variations in an interferometer sys- 
tem of the type used in the present invention is unflatness of the mirrors 32. 33. 34. 35. The 45" mirrors 31 €ure attached 

20 to the wafer table WT and travel with it as it swaps between exposure and measurement stations. Any unflatness of 
these minors therefore has largely the same effect on positioning at the exposure station as at the measurement sta- 
tion, and largely cancels out. However, the minors 32, 33. 34 and 35 mounted on the metrology frame MF stay with their 
respective interferometers and so any differences in the surface contours of the corresponding pairs 32. 34 and 33, 35 
can adversely affect the vertical positioning accuracy of the substrate tatiie WT 

25 [0042] The confidence sensors 20a and 20b are used at initial set-up of the apparatus, and periodically as required 
thereafter, to calibrate the differences between ttie Z-interferometers at tiie measurement and exposure stations. The 
confidence sensors are sensors capable of measuring the vertical position of tiie upper surface of the wafer at one or 
more points as the substi^ate table is scanned underneath it. Confidence sensors 20a sjnd 20b can be similar in design 
to level sensor 10 but need not be; since tiiey are used only at setup (and for infrequent recalibration) and with a refer- 

30 ence wafer rather than production wafers, the design criteria are less onerous and advantage can be taken of this to 
design a simpler sensor. Conversely, the existence of the projection lens PL at the exposure station will restrict ttie 
physical locations available for the confidence sensor at that station, and this also needs to be taken into account in 
design or selection of each confidence sensor. High accuracy is required of the confidence sensors since tiie calO^ration 
they are used for wilt affect the quality of every exposure. 

35 [0043] In the calibration process using the confidence sensor(s), a reference wafer is loaded onto the substrate 
table. The reference wafer is preferably a bare silicon wafer. There is no requirement for it to be any flatter than a normal 
bare Si wafer but its surface finish (in terms of reflectivity) is preferably optimized for the confidence sensors. In a pre- 
ferred embodiment of the invention the reference wafer is preferably polished to maximize its reflectivity and minimize 
unflatness. 

40 [0044] In ttie calibration procedure, a partial height map of the reference wafer (as usual referenced to the physical 
reference surface) is generated at the measurement station using the confidence sensor 20b instead of tiie level sensor 
10. This is done in tiie same manner as with the level sensor 10: the physical reference surface (TIS) is positioned at 
the zero point of the confidence sensor to zero the Z-interferometer, the wafer is then scanned under the confidence 
sensor, and the height map is generated from the difference between the confidence sensor and Z-irtterferometer read- 

45 ings. A height map is also generated at the exposure station using the confidence sensor 20a at the same points as the 
measurement station height map. For tiiis calibration, the height maps need not be a complete scan of the wafer; ttiey 
need only cover st-ips corresponding to the movement of the Z-interferometer beam over the min^ors 32-35. (The order 
in which the maps are created is not important, provided ttie wafer is stable on ttie substarate table whilst both are done.) 
[0045] Since the height maps represent the same wafer, any differences between them will be caused by differ- 

50 ences between the measurement systems used to create them. The two confidence sensors are static, so their effects 
on the height maps will not be position-dependent and can be eliminated by normalizing the two height maps and/or 
sutrtracting static offsets. Any remaining differences will be position-dependent, and tiie two height maps can be sub- 
tracted from one another to generate correction tables (mirror maps) that relates the exposure station Z-interferometer 
to the measurement station Z-interferometer. These correction tables can be attributed to the differences between ttie 

55 mirrors 33. 35 and 32. 34 attached to tiie metrology frame MF and can be applied to ttie wafer height maps generated 
in a production process, or used to correct one of the Z-interferometers used to generate tiie map or to position the sub- 
strate table during the exposure. DeperxJing on the precise construction of the Z-lnterferometers. particularly the 
metrology frame mirrors cmd substrate table minrors, the differences in Z position caused by the unf latnesses of the mir- 
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rors in each interferometer system may also be tilt dependent In one or more degrees of freedom (Rx. Ry, Rz). To elim- 
inate this tilt dependence it may be necessary to use the confidence sensors to create several height maps with the 
wafer stage at various different tilts, from which a number of different correction tables (mirror maps) can be derived, as 
necessary. 

5 [0046] Having described the principle of the off-axis levelling procedure, now will be described some further refine- 
ments to it that are employed in the second embodiment, as well as how it is integrated into the production process. 
Figures 9 and 10 are referred to and respectively show the steps carried out at the measurement station and at the 
exposure station. In a lithography apparatus using two wafer tables, one table will be going through the steps of Figure 
9 whilst a second simultaneously goes through the steps of Figure 10 before they are swapped. In the description 

10 below, the life" of a single wafer is followed from measurement station (Rgure 9) to exposure station (Figure 10) and 
back. 

[00471 Starting at step SI in Rgure 9. a wafer coated with a photosens'rtive resist is loaded on to the substrate table 
WT. (Note that this may generally take place at a loading station separate from the measurement station at which the 
substrate table Is out of range of the interferometer system IF.) The wafer table is moved into the capture range(s) of 

IS one or more position sensitive devices (PSDs) so that an initial coarse zeroing of the interferometric metrology system 
can be performed, step 82. After the initial coarse zeroing, the fine initialization/zeroing of the interferometric system 
follows in steps S3 and S4. These two steps contain the level sensor measurements (denoted "LS") on tiie (two or 
more) physical reference surfaces, which will define tfie reference plane (fixed to the wafer table) with respect to which 
the wafer height map is measured. Also, two alignment measurements (denoted "AA") are done on markers located on 

20 the same physical reference surfaces, so as to define the horizontal reference positions fixed to tiie wafer table. These 
measurements In S3 and 84 effectively zero the interferometric system in all degrees of freedom. 
[0048] The next step In the levelling procedure is step 85. referred to as the global level contour (QLC). In this step, 
which is described further below, a wafer capture and an initial scan of the wafer with the level sensor is made to deter- 
mine its overall height and tilt as well as its approximate height at the points where the subsequent detailed scan will 

2S move onto or off the wafer. This information enables the substrate table trajectory for ttie wafer height map scan to be 
defined. 

[0049] in step 86. a global alignment of the wafer is done. At least two alignment markers on the wafer are meas- 
ured (W1 and W2), meaning that their XY position is determined with respect to the reference markers on the TIS f Idu* 
clals. This determines to what extent the wafer is horizontally rotated (Rz) with respect to the scan direction (y). and Is 
30 done to be able to correct the wafer rotation such that the wafer height map scans are done parallel to tiie exposure 
area axis (I.e. "going straight over the exposure areas"). 

[0050] After that, tiie levelling procedure continues with measurements necessary for a process dependent connec- 
tion (PDC). A process deperxlent correction is necessary with some forms of level sensor, and will now be explained. 
[0051 ] The wafer height map must be taken each time a wafer is exposed. If a wafer has already been subjected to 

35 one or more process steps, the surface layer will no longer be pure polished silicon and there may also be structures or 
topology representing tiie features already created on the wafer. Different surface layers and structures can affect the 
level sensor readings and in particular can alter its linearity. If the level sensor is optical, these effects may, for example, 
be due to diffraction effects caused by the surface sti-ucture or by wavelength dependence In the surface reflectivity, and 
cannot always be predicted. To determine tiie requiired process dependent correction, an exposure area or die Is 

40 scanned under the level sensor with the substaate table WT set to several different vertical positions spanning the linear 
or linearized range of the level sensor 10. The wafer height, i.e. the physical distance t>etween the wafer surface and 
the reference plane, should not change with the vertical position of the substrate XabUe; it is obtained by subtracting the 
measurements of the level sensor and Z-interferometer: ^^afbh = ^is'^[f ■ Therefore if the determined value of 
ZwAFER does change with vertical position of the substrate table this indicates that either or both the level sensor or Z- 

45 interferometer are not linear or not equally scaled. The Z-interferometer is deemed to be linear since it looks at the mir- 
rors on the wafer table and metrology frame; and in fact is linear to a greater extent than the required accuracy for the 
wafer height map, at least once the correction determined by the use of the confidence sensor is applied. Therefore, 
any differences In the wafer height values are assumed to result from non-linearity or mis-scaling of the level sensor. 
They, and the knowledge of at which level sensor readings tiiey were observed, can be used to conect ttie output of the 

so level sensor. It has been found in a presentiy preferred embodiment of the level sensor that a simple gain correction Is 
sufficient, but a more complex correction may be required for otiier sensors. 

[0052] If the wafer to be processed has exposure areas on it that have been subjected to different processes, then 
a process-dependent correction is determined for each different type of exposure area on the wafer. Conversely, if a 
t^tch of wafers having exposure areas tiiat have undergone the same or similar processes are to be exposed, it may 
55 only be necessary to measure the process-dependent correction for each type of exposure area once per batch. That 
correction can then be applied each time tiiat type of exposure area Is height-mapped in tiie batch. 
[0053] In many IC fabs. the photosensitive resist Is applied to the wafer immediately before it is loaded Into the 
littiography apparatus. For tills, arxl other, reasons, the wafer may be at a different temperature tiian ttie substrate table 
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when it is loaded and clannped In place. When the wafer cools (or warms) to the same temperature as the substrate 
table* thermal stresses can be set up because the wafer is clanrped very rigidly using vacuum suction. These may result 
in undesirable distortion of the wafer. Thermal equilibrium is likely to have been reached by the time the steps S2 to S7 
have been completed. Therefore, at step S8. the vacuum damping the wafer to the substrate table is released, to allow 
5 tiie thermal stresses In the wafer to relax, and then reapplied. This relaxation may cause small shifts in tiie position 
and/or tilt of the wafer but these are acceptable since steps 32 to S4 are independent of the wafer and S5 and S6 are 
only coarse measurements. Any shift in the wafer position at this stage does not affect tiie process-dependent correc- 
tion since that is a calibration of tine level sensor rather than a measurement of the wafer. 

[0054] After the vacuum has been reapplied, and from here on it Is not released again until the exposure process 

10 is finished, the Z-map Is carried out at step S9. The scan required for the Z-map must measure the height of sufficient 
points to enable the wafer to be positioned during exposure at the desired accuracy. It is also important that the points 
measured cover the actual area where the wafer Is to be exposed; measurements taken over non-exposure areas, such 
as scribe lanes and so-called mouse bites, may give misleading results. Accordingly, tiie height mapping scan must be 
optimized to the specific pattern of exposure areas on tiie wafer at hand; tills is described further below. 

75 [0055] Once the Z-map is completed, the advance alignment measurements, step S10. are candied out before the 
substrate table is swapped, at step S1 1 , to the exposure position. In the advance alignment process, the positions of a 
number of alignment markers on the wafer relative to the reference markers F located on the TIS fiducial (physical ref- 
erence surface) fixed to the substrate table are accurately determined. This process is not particularly relevant to the 
present invention and so is not described further herein. 

20 [0056] In the swap procedure, the substrate table canTing the height-mapped wafer anrives at the exposure station, 
step SI 3 in Figure 10. A coarse position determination of the substrate table is made at step SI 4 and. if necessary, a 
new mask MA is loaded onto the mask table MT, step Si 5. The mask loading process may be carried out. or at least 
begun, simultaneously with the substrate table swap. Once a mask is in position and a coarse position determination, 
step S14. has been made, a first TIS scan Is earned out using sensor TlSl at step S16. The TIS scan measures the 

25 vertical and horizontal position of tiie substrate table at which the TIS is located in the aerial image focus of the projec- 
tion lens, as described atx)ve, yielding a focal plane reference. Sirtce the height map generated as step S9 in Figure 9 
is referenced to ttie physical surface in which tiie TIS is located, tiie vertical positions of the substrate tatrfe necessary 
to put the wafer surface In ttie focal plane for the different exposure areas are directly derived. A second TIS scan, step 
S17. is also carried out using sensor TIS2. yiekiing a second point for referencing a focal plane. 

30 [0057] Once the TIS scans have been completed and the focal plane determined, the exposure process SI 8 is car- 
ried out. optionally after any necessary system calibrations in step S19 (e.g. adjustments to con^ect for lens heating 
effects). The exposure process will generally involve the exposure of multiple exposure areas using one or more masks. 
Where multiple masks are used, after mask exchange S20. one TIS scan S17 can be repeated to update any focal 
plane changes. Between some or all exposures, the system calibration step S19 may also be repeated. After comple- 

35 tion of all exposures, the substrate table carrying the exposed wafer is swapped at step S13 for tiie substi-ate table car- 
rying the wafer that has meanwhile been undergoing steps SI to SI 0 of Figure 9. The substrate table carrying ttie 
exposed wafer is moved to the loading station and tiie exposed wafer taken out so that a fresh wafer can be loaded and 
the cycle can resume. 

[0058] To explain the wafer height mapping scan of step S9 of Rgure 9. Figure 1 1 shows an example of a pattem 

40 of exposure areas C of various shapes and sizes arranged on a wafer to make best use of ttie silicon area. The different 
exposure areas C are separated by scn*be lanes SL and generally-triangular unused areas, known as "mouse-bites'* are 
inevitably left between the rectangular exposure areas and the curved edge of the wafer. The scribe lanes are where 
the wafer will be cut once all production processes have been completed (so as to separate the different devices) and 
some cutting techniques may require that the scribe lanes in one direction all span the entire width of the wafer; in that 

45 case It is convenient to orient tiiese full wafer-widtfi scribe lanes parallel to the scanning direction (e.g. ttie Y direction) 
if the apparatus is to be used in step-arxj-scan mode. The scribe lanes and mouse bites may not be exposed, and so 
after the wafer has been subjected to a few process steps or depositions of layers they may have very different heights 
and surface properties than the exposure areas C. Accordingly it is Important to disregard any height measurements in 
these areas, wfiich are not going to be exposed. 

so [0059] A presently preferred embodiment of the level sensor uses a linear anay of. e.g. . nine optical spots arranged 
perpendicular to the scanning direction to measure ttie height at nine points (areas) simultaneously. (Note ttiat the Z- 
interferometer data can also be interpolated to provide corresponding Z-position data of the substi-ate table at an array 
of corresponding level sensor points.) The array of spots is of a size suffteient to cover the width of the widest exposure 
area tiiat can be exposed in the apparatus. 

55 [0060] The presently preferred scanning scheme is to scan ttie array of spots in a meander path 50 such that the 
center spot of ttie array passes atong the midline of each column of exposure areas; ttils midline corresponds to ttie 
midline of the Illuminated slit in ttie exposure process. TTie data thus generated can be directiy related to the exposure 
scan with a minimum of rearrangement or cateulation. This metiiod also eliminates part of the mirror unflatness effect. 
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since, at both measurement and exposure stations, scans are carried out with the Z-interferometer beam pointing at the 
same position on the mirrors 31 attached to the substrate table. If the column of dies is narrower than tiie array of spots 
of the level sensor, data obtained from the spots not lying wholly within the exposure area are ignored. In other embod- 
iments of the level sensor it may be possible to adjust the width of the array of spots to match the width of the exposure 
5 areas. 

[0061 ] If a wafer has some exposure areas whose center lines are offset in the direction perpendicular to the scan- 
ning direction from those of the remainder, a nxx:lified scanning scheme may be used to advantage. This situation is 
illustrated in Rgure 12 which shows one row of dies E whose center lines are offset from tiie remaining dies D. In such 
a case, tiie map can be created more quickly and witii fewer accelerations for the substrate table by scanning two 
10 meander patiis. One patii, referenced 52 in Rgure 12, covers one set of exposure areas D and the other, referenced 
53, covers the otiiers E. Of course, other arrangements of the exposure areas may require further modifications to tiie 
scanning scheme. 

[0062] Where the level sensor has a limited linear or linearized range, which is likely the case, the substrate table 
WT must be scanned underneatii it at a vertical position that brings the wafer surface into tiiat range. Once the wafer 

IS surface has been found it is a simple matter, by means of a closed feedback loop of the level sensor reading to tiie sub- 
sti-ate table positioning system, to adjust the vertical position of the substrate table WT to keep the wafer surface in the 
linear or linearized range but it is not so simple to find the wafer surface when the le^el sensor first moves onto an expo- 
sure area from outside tiie wafer. In a meander path there are several such in-points, referenced 51 and indicated by 
arrows on the meander path 50 in Rgure 1 1 . compounding the problem. 

20 [0063] To find the wafer surface at the in-points 51 it is possible to provide a capture spot in advance of the main 
level sensor spot array. The reflection of the capture spot on the wafer is then directed to a detector that has a wider 
capture range than is tiie case for the main spots. This, however, requires additional hardware: a capture spot on botii 
sides of the main spots (before/after) or a restriction to scanning in only one direction. An altemative, not necessarily 
requiring additional hardware, is to stop the substrate table close to each in-point. perform a wafer capture and measure 

25 the wafer surface in the linear or linearized range of tiie level sensor to approximate tiie wafer surface position at the in- 
point. This however slows down the measurement procedure signif icantiy, which may have undesirable consequences 
for throughput. 

[0064] In this embodiment of the invention, these prot)lems are avoided by performing a global level contour scan 
mentioned above (step S5 in Rgure 9) after the wafer surface is captured. The global level contour scan is explained 

30 further witti reference to Figure 1 3. 

[0065] For ttie gkJbsA level contour scan the substiBte table is first positioned so that a convenient point (preferably 
near tiie edge) within an exposure area C is underneath a single capture spot and the main spots of the level sensor 
(spot array). The wafer surface is found, e.g. by scanning the substrate table vertically until tine wafer surface is captured 
and comes within the linear or linearized range of the main spots, and then the subsfrate table is scanned so tiiat tiie 

3S central spot 41 traverses a path 60 around tiie inside of the perimeter of the total exposure area. The captijre procedure 
is described further below. Measurements of tiie wafer surface height are taken at defined positions around tiie scan. 
Where other spots of the array as well as the center spot fall over (exposure areas of) the wafer, the measurements from 
these spots, as well as the central one, can also be taken. However, measurements should not be taken from spots fall- 
ing outside the exposure areas. As illustiBted, tiie global \e^e\ contour patii 60 is a winding patii following the edges of 

40 the exposure areas quite dosely: however a smoother path may also be employed and. particularly when tiie wafer is 
well filled with exposure areas, a drcular course 61 may well suffice and be more convenient The glok>al level contour 
may also be arranged as a drcle passing over mouse bites, in which case measurements are not taken over the mouse 
bites, or tiie data of any measurements taken on mouse bites are disregarded in calculation of the global height and tilt 
of the wafer. 

45 [0066] The data gathered in the global level contour scan are used for two purposes. Firstiy data relating to the 
wafer height in the vicinity of the in-points 51 (see Rgure 1 1) of the height mapping scan to be carried out later are used 
to predict the wafer height at the in-points 51 so that the sut>sti'ate table can be brought to the correct height to get the 
wafer surfoce position in the linear or linearized level sensor range during the mapping scan. In most cases only a few 
data points are required for this and they need not be particularly dose to the in-points to allow a suffidentiy accurate 

so prediction of tiie wafer height to be determined by interpolation or extrapolation. It is also desirable to know the local Ry 
tilt at the in-points 51 for the height mapping scan, since the level sensor has an array of spots in the X-direction which 
(preferably) all need to be brought within their linear or linearized ranges. If the global level contour scan is parallel, or 
nearly parallel, to the Y direction in the vicinity of any in-point. the Ry tilt cannot be accurately determined using data 
obtained from only a single spot. Where a level sensor having an array of measurement spots spaced apart in the X 

55 direction, such as that described below, is used, data from multiple spots can be used to determine a local Ry tilt Of 
course, data from spots lying witiiin the exposure area are selected If part of the an-ay falls outside tiiat area. 
[0067] The second use of the global level contour data is to determine a global, or average, height and tilt (around 
2 axes) fbr the whole wafer. This is done by known mathematical techniques. e.g. a least-squares mettiod. to determine 
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a plane that most closely fits the wafer height data gathered. If the global tilt (sometimes referred to as the "wedge^ is 
greater than a predetermined amount, this may well indicate an incorrect loading procedure. In that case the wafer can 
be unloaded and reloaded for a retry and even rejected if it continues to fail. The global height and tilt information is 
used to focus an advance alignment sensor used in step Si 0 of Rgure 9 to accurately determine the spatial relationship 
5 of alignment markers on the wafer to reference markers on the substrate stage. The advance alignment sensor and 
process are described in greater detail in WO 98/39689 (P-0070). 

[0068] During a wafermap scan, the level sensor 10 provides continuous Z and Ry feedback signals to the sub- 
strate taJoAe to keep the level sensor 10 in its linear or linearized range. If this feedback loop stops (the level sensor 10 
doesn't supply conrect numbers) the table is controlled by following a path corresponding to the glok>al wafer wedge (a 

10 Z profile according to global Rx). 

[0069] A presently preferred embodiment of the level sensor 10 is illustrated in Figure 14 and will be explained 
betow additionally with reference to Rgures 14A to 14G. which show aspects of the operation of the sensor. 
[0070] Level sensor 10 comprises a beam generation branch 1 1 which directs a measurement beam b|_s onto ttie 
wafer W (or the physical reference plane when the vertical position of tiiat is being measured, or any reflecting surface) 

IS and a detection branch 1 2 which measures the position of the reflected beam, which is dependent on the vertical posi- 
tion of tiie wafer surface. 

[0071 ] In the beam generation brsuich, the measurement beam is generated by light source 111, which may be an 
array of light emitting or laser diodes, or generated elsewhere and passed to "illuminator" 111 by optical fibers. The 
beam emitted by light source 111 preferably contains a wide band of wavelengths, e.g. from akx3ut 600 to 1050 nm, so 

20 as to average out any wavelengtti dependence of interference effects from the wafer surface, particularly after some 
process steps have been completed. Illumination optics 112. which may include any suitable combination of lenses and 
mirrors, collect the light emitted by light source 1 1 1 and evenly illuminate projection grating 113. Projection grating 113 
is shown in greater detail in Figure 14A and consists of an elongate grating 1 13a. which may be divided to generate an 
array of separate/discrete spots, with grating lines parallel to its axis, and an additional aperture 1 13b which forms a 

25 capture spot ahead of tiie main detection spot array on the wafer. The period of tiie grating will be determined in part 
by the accuracy at which the wafer surface position is to be measured and may. for example be about 30^m. The pro- 
jection grating is positioned with a small rotation around its optical axis such that tiie grating lines projected on the wafer 
are not parallel to any wafer coordinate axis, thereby to avoid interference with structures on the wafer which are along 
the X or y direction. Projection lens 1 14 is a telecentric system which projects an image of the projection grating 113 

30 onto the wafer W. Projectton lens 114 preferably consists essentially or only of reflecting optical elements so as to min- 
imize or avoid chromatic abenration in the prcjected image; since the projection beam is broadband these cannot easily 
be eliminated or compensated for in a refractive optical system. Folding mirrors 1 15. 116 are used to bring the projec- 
tion beam b^s into and out of the projection lens 114 and permit a convenient arrangement of tiie components of the 
beam generation branch. 

35 [0072] The projection beam bts is incident on the wafer at a fairly large angle, a, to tiie normal, e.g. in the range of 
from 60' to 80*, and is reflected into the detection branch 12. As shown in Figure 14B. if tiie wafer surface WS shifts in 
position by a distance Ah to position WS', then the reflected beam will be shifted relative to the beam r. prior to the 
shift in ttie wafer surface, by a distance 2. Ah.sin(a) . Rgure 14B also shows tiie appearance of tiie image on the wafer 
surface: because of tiie large angle of inckience. the image is spread out perpendicular to the grating linea 

40 [0073] The reflected beam is collected by detection optics 121 and focused on detection grating 126. which is 
essentially a copy of projection grating 1 13 and is sub-divkied to correspond to the spot-array pattern. Detection optics 
121 are directiy complementary to projection optics 114 and will also consist essentially or only of reflective elements, 
to minimize chromatic aberration. Again folding mirrors 122, 123 may be used to enable a convenient arrangement of 
the components. Between detection optics 121 and the detection grating 126 are positioned a linear polarizer 124 to 

45 polarize the light at 45* and a birefringent crystal 125 which causes a shear perpendicular to tiie grating lines equal in 
magnitude to the grating period between the horizontal and vertical polarized components of the light. Figure 14C 
shows the beam as it would be at the detection grating 126 without the birefringent crystal: it is a series of alternating 
light and dark bands with tiie light bands polarized at 45°. The birefringent crystal 1 25 shifts tiie horizontal and vertical 
polarization states so ttiat the light bands of the horizontal polarization component fill the dark bands of the vertical 

so polarization component. As shown in Figure 14D. the illumination at the detection grating 126 Is therefore uniform grey 
but has stripes of alternating polarization state. Figure 14E shows the detection grating 126 overlaid on this pattern, 
which depends on tiie vertical position of tiie wafer surface; when the wafer is at a nominal zero vertical position, the 
detection grating 126 will overly and block half of tiie light bands of one polarization state. e.g. tiie vertical, and half of 
the otiier state. 

55 [0074] The light passed by the detection grating 1 26 is collected by modulation optics 1 27 and focused on detector 
128. Modulation optics include an polarization modulation device driven by an alternating signal. e.g. with a frequency 
of about 50kHz. so as to pass tiie two polarization states alternately. The image seen by the detector 128 therefore 
alternates between tiie two states shown in Rgure 14R Detector 128 is divided into a number of regions corresponding 



13 



EP1 037117 A2 

to the array of spots whose height is to be measured. The output of a region of detector 128 is shown in Rgure 14Q. tt 
is an alternating signal with period equal to that of the nnodulating optics and the amplitude of the oscillations indicates 
the degree of alignment of the reflected image of the projection grating on the detection grating, and hence the vertical 
position of the wafer surface. As mentioned above, if the wafer surface Is at the nominal zero position, the detection grat- 
ing 126 will block out half of the vertical polarization state and half of the horizontal polarization state so that the meas- 
ured intensities are equal and the amplitude of the oscillating signals output by the detector regions will be zero. As the 
vertical position of the wafer surface moves away from the zero position, the detection grating 126 will begin to pass 
more of the horizontally polarized bands and block more of the vertically polarized bands. The anplitude of the oscilla- 
tions will then increase. The amplitude of the oscillations, which is a measure of the vertical position of the wafer sur- 
face, is not directly linearly related to the vertical position of the wafer surface in nanometers. However, a correction 
table or formula can readily be determined on initial setup of the apparatus (and periodically recalibrated if necessary) 
by measuring the constant height of the sur^ce of a bcU-e silicon wafer at various different vertical positions of the sub- 
strate table, using the calibrated Z-interferometer and uncalibrated level sensor 10. 

[0075] lb ensure that the measurements of the level sensor and the Z-irrterferometer are taken simultaneously, a 
synchronization bus Is provided. The synchronization bus carries clock signals of a very stable frequency generated by 
a master clock of the apparatus. Both the level sensor and Z-interferometer are connected to the synchronization txjs 
and use the dock signals from the bus to determine sampling points of their detectors. 

[0076] The capture spot 1 1 3b passed by tiie projection grating 1 1 3 passes the detection grating, where it is incident 
on tiiree separate detection regions, two 131, 133 set high and one 132 set low, as shown in Rgure ISA. The output 
from the low detection region is subtracted from those of the high regions. The capture spot detector regions are 
arranged so that when the wafer surface is at the zero position, the capture spot falls equally on the high and low detec- 
tion regions and the subtracted output is zero. Away from the zero position, nx>re of the capture spot will fall on one of 
the detection regions than the other and the suk>tracted output will increase in magnitude with its sign indicating whether 
the wafer is too high or too low. The dependence of the subtracted detector output dcap on substrate table position Z|p 
is illustrated in Figure 15. This form of detector output allows a faster zero capture metiiod than a conventional servo 
feedback. According to the improved method, referred to as "move-untir, when the capture spot detector indicates that 
the wafer surface is too high or too low. the Z-position actuators of the substrate table are instructed to move the stage 
in the appropriate direction to bring the wafer surface into the linear or linearized range of the main level sensor array. 
The movement of the wafer stage continues until the output of the capture spot detector dcap passes a trigger level t^ 
or t| according to which direction it Is traveling. Crossing tiie trigger level causes the apparatus control to issue a com- 
mand to the Z-position actuators to begin a braking procedure. The trigger levels are set so that, in tiie response time 
and the time taken to brake the stage motion, the stage will move to. or close to, the zero position. Thereafter the stage 
can be brought to the zero position under control of the more accurate main level sensor spots. The trigger points wilt 
be determined In accordance with tiie dynamics of the stage and need not be symmetrically spaced about zero detector 
output. This "move-untir control strategy enables a rapkl and robust zero capture wittiout requiring a linear measure- 
ment system, and can be used in other situations. 

[0077] The level sensor described above can be further optimized to improve its performance. Improvement in 
accuracy in the scan (Y) direction can be effected by appropriate signal filtering and this may be adapted to specific 
process layers observed on partiy processed wafers. Additional improvements (for specific process layers) in all direc- 
tions may be obtained by optimization of the measurement spot geometry, which can be adjusted by changing the illu- 
mination optics 1 12 (to adjust the uniformity and/or angular distribution of tiie illumination li^^ on the projection grating 
1 13), by changing the projection grating 1 13 or by adjusting the detection system (size, position and/or angular resolu- 
tion of tfie detector and the number of detectors). 

[0078] A presentiy preferred form of the confidence sensors 20a, 20b is illustrated in Figures 1 6 and 1 7. The beam 
generation branch 21 comprises a light source 21 1 (e.g. a solid state laser diode or super-luminescent diode) which 
emits light of limited bandwidth. It is convenientiy situated away from the metrology frame and its output brought to the 
desired point by an optical fiber 212. The light is output from fiber terminator 213 and directed onto a beam splitter 215 
by collimating optics 214. Beam splitter 215 creates two parallel measurement beams bcsi and bcs2 which are focused 
to evenly illuminate respective spots 23 on the wafer W by telocentric projection optics 216. Since the measurement 
beams of the conf kience sensor have a limited bandwkfth, projection optics 21 6 can convenientiy employ refractive ele- 
ments. Detection optics 221 collect the reflected beams and focus them at tiie edge of detection prism 222 which is 
positioned between detectors 223. 224 and detection optics 221 . As shown in Rgure 1 7, which is a side view of detec- 
tion prism 222 and detector 223, a measurement beam is incident on tiie back of detection prism 222 and exits through 
angled faces 222a, 222b. Detector 223 consists of two detector elements 223a, 223b positioned so that light emerging 
from face 222a of detection prism 222 reaches detector element 223a and that emerging from face 222b reaches 
detector element 223b. Detector 224 is similar. Outputs of detector elements 223a and 223b are intensity-scaled and 
subtracted. When the wafer surface is at tiie zero position, the measurement beam fells symmetrically on faces 222a, 
222b of detection prism 222 and equal amounts of light will be directed to detector elements 223a and 223b. These will 
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then give equal outputs and so the subtracted output will be zero. As the wafer suiiace moves away from the zero posi- 
tion, the position of the reflected beam will move up or down and fall more on one of faces 222a. 222b than on the other 
resulting in more light being directed to the respective detector element so that the subtracted output will change pro- 
portionally. A tilt of the wafer can be determined by comparison of the outputs of detectors 223 and 224. 

5 [0079] This arrangement provides a simple and robust height and level detector that can be used as the confidence 
sensor in the second embodiment of the present invention as well as in other applications. The confidence sensor is 
primarily intended for initial set up and periodic, e.g. monthly, recalibration of the Z-interferometers of the measurement 
and exposure stations. However, the confidence sensor described above has a wider capture zone and more rapid 
response than the TIS used for precise determination of the position of the focal plane of the projection lens PL relative 

10 to substrate table WT Accordingly, the confidence sensor 20a can advantageously be used, when the substrate table 
is first swapped to the exposure station, to make a coarse determination of the vertical position of the TIS. The height 
measured by the confidence sensor is related to previously measured best focus position(8) and used to predict a start- 
ing point and range for the TIS scan near the expected position of the best focal plane. This means that the TIS scan, 
desaibed above, can t^e made shorter arxi hence quicker, improving throughput. 

IS [0080] A beam splitter 215 that can be used in the confidence sensors is shown in Rgure 18. A beam splitter is 
composed of a number of prisms from the same glass and preferably of equal thickness. The l>asic operation principle 
is described using a beam splitter cor^isting of 3 prisms 51, 52. 53. Prism 51 is trapezoidal in cross-section and the 
input beam 54 is incident normally on Its top face 55 near one side. The position of input beam 54 is such that it meets 
one side face 56 of first prism 51 which is at 45*" to the top face 55. Second prism 52 is joined onto side face 56 of first 

20 prism 51 and the join is coated so that a desired proportion of the input beam (half in the present embodiment) contin- 
ues into second prism 52 to form beam 57 whilst the remainder is reflected horizontally within first prism 51 to form 
beam 58. Beam 58 reflected in first prism 51 meets the second side face 59 of that prism, which is parallel to the first 
side face 56 and is r^lected downwards out of the lower face of first prism 51 and through top and bottom faces of third* 
prism 53 which are parallel to top face 55 of first prism 51 . Second side face 59 may be coated as necessary to ensure 

25 total internal reflection of beam 58. Beam 57, which passed into second prism 52. is reflected internally by two parallel 
faces of second prism 52, which are perpendicular to side face 56 of first prism 51 . and emerges from the kx)ttom face 
of second prism 52 which is parallel to the top face 55 of first prism 51 . Beams 57 and 58 are thereby output in parallel, 
but displaced. The separation between beams 57, 58 is determined by the sizes of prisms 51 and 52. Prism 53 is pro- 
vided to equalize the optical path lengths of beams 57. 58 so that the imaging optics for tx>th beams can be identical. 

30 Prism 53 also supports prism 52 as illustrated but this ms^ not be necessary in some applications. To enhance the 
reflection of beam 57 at tiie surface where prisms 52 and 53 meet, a void may be left or a suitable coating provided. 
[0081 ] Beam splitter 50 is simple, rotxjst and easy to construct. It provides output beams in parallel (whereas a con- 
ventional cubic beam splitter provides perpendicular beams) and with equal path length. The splitting suriace can be 
made polarization selective or not. and in the latter case can divide the input beam intensity evenly or unevenly as 

35 desired. 

[0082] It is a feature of the level and confidence sensors described above, as well as other optical height sensors, 
that they are insensitive to tilt of the wafer stage about an axis perpendicular to tiie Z-direction defined by the intersec- 
tion of the wafer surface WS and the focus plane of the measurement spot of the level sensor 10. This is due to the fact 
that the sensors measure a height over the area off the measurement spot extrapolated to tiie spot* s focus axis. The tilt 
40 insensrtivity can be used to calibrate the Z-interferometers and the optical sensors towards each other in tiie XY plane. 
The procedure for such calBaration is described with reference to Figure 1 9 and the level sensor, but a similar procedure 
can be used with the confidence sensor or any other similar optical sensor. 

[0083] The positioning system of the sut)strate table is linked to tiie multi-axis interferometer system of which tiie 
Z-interferometer is a part, and can be set to apply a rotation about a selected axis in the XY plane using spaced-apart 

45 Z-actuators. To align the Z-interferometer measurement position with the level sensor measurement spot, the position- 
ing system is used to rotate the stage about an axis passing through the Z-interferometer measurement position and 
parallel to. for example, the Y axis. The Z position of the table as measured by the Z-interferometer will remain 
unchanged during this tilt. If tiie level sensor and Z-interferometer are exactiy aligned, then tiie wafer surfece position 
will also remain unchanged. However, if the level sensor measurement position is offset from the Z-interferometer posi- 

50 Won by an amount 5X. as shown in Rgure 19. then tilting the substrate table WT to the position shown in phantom in 
that Figure will cause a change 6Wls in the level sensor output. The offset 6X, and the offset 5Y in the Y direction, can 
therefore be quickly determined by detecting any change in level sensor output with tilts atxsut two. preferably perpen- 
dicular, axes passing tiirough the Z-interferometer position. The parameters of the interferometer system or the level 
sensor 10 can tiien be adjusted to ensure that the Z-interferometer measurement position is exactiy opposite the level 

ss sensor measurement position. 

[0084] Where tiie level sensor uses an array of measurement spots, it cannot always be ensured that the spots are 
exactly aligned. The above technique can therefore be used to determine any offsets of the individued spots from the 
nominal position w.r.t the Z-interferometer position. This Information can then be used to correct the height map or the 
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level sensor output. 
Embodiment 3 

[0085] The third embodiment employs the levelling principle of the first embodiment and is the same as that en*od- 
iment except as described below. The third embodiment may also make use of the hardware and refinements of the 
second embodiment, described above. However, the third embodiment makes use of an improved method for optimiza- 
tion of the exposure path. This is explained below with reference to Figure 20. 

[0086] As discussed above, it is convenient and valkl to consider that the 8ut>strate stage is stationary and that the 
exposure slit Image moves, even though in practice it is the wafer that moves. The explanation below is given from this 
view point 

[0087] Figure 20 illustrates the notations used below. It should be noted that, although the slit image SI is depicted 
for clarity In Rgure 20 spaced from the wafer surface, the aim of the optimization procedure is to er^ure that during an 
exposure the focus plane of the slit image coincides as far as possible to the wafer surface. Considering a one dimen- 
sional wafer whose surface is defined by My) and a slit image SI, the moving average (over time) defbcus MAly) cor- 
responding to a coordinate on the wafer can be calculated from: 



MA(y)^ - llufyJ'fzfy-^vJ'uRxfy-^vjj^ (2) 

^ -s/2 

where the integral is taken over the slit size, s, in the scan direction and the integrand w(y)-[z(y+v)'V.Rx(y+v)] is the 
focus error on a point of the wafer at a certain moment in time. Similarly, the n[x>ving standard deviation for a point on 
the wafer can be defined as: 



MSD^(y)^- l[u(y)-[z(y'\-v)-'uRx(y+v)J''MA(yj\'€b (3) 

which is the defocus variation in time during the actual exposure of that point on the wafer. To minimize the difference 
between the plane of the exposure slit image and the wafer, a quadratic defbcus term is used, defined as follows: 



MF^(y)^ - \(ud[y)-[z(y^ vj- ziRx(y^ vJJJ'ch (4) 

where MF(y) is called the moving focus. It will be seen that MF(y) can also be written in terms of MA(y) and MSD(y) as 
follows: 



MF^(y)=MA Y y) + MSD^(y) (5) 



This means that in the optimization of the exposure path and minimisation of the moving focus over the exposure area, 
both the moving average and the moving standard deviation are taken into account, in contrast to the simpler least- 
squares optimization of the first embodiment, which neglects any time, and thus scanning, integration. Equations [3] 
and [4] can easily be extended to two dimensions by adding Ry(t) dependency and integrating MF over X from -W/2 to 
where W is the width of the slit in the X-direction. To calculate the optimization it is convenient to use a frequency 
domain representation. Calculation in the frequency domain also enables Ngh-frequency variations in the setpoints, 
that would result in excessive substrate stage accelerations in any or all of the degrees of freedom, to be filtered out. 
such that the exposure path is optimized for the performance of the substrate table positioning system. 



16 



EP1 037117 A2 



[0088] in the above discussion, the optimum focus of the exposure slit image is assumed to conform to a plane; 
however, this is not necessarily the case: the optimum focus may in fact lie on an arbitrary surface, resulting in a so- 
called focal plane deviation (FPD). If the contour of that surface over the exposure slit area can be measured using the 
TIS to create a focus map f (x,y). or calculated, then the resulting data or equations can be added to the equations above 
so that the wafer motion is optimized for the actual optimum focal surfeice. 

[0089] The optimization technique of the third embodiment can result in better focus for scanning systems and 
smootiier substrate stage trajectories, irtcreasing tiiroughput and yield. 

Embodiment 4 

[0090] In a fourth embodiment, the level sensor is provided with additional features to counteract errors in the 
measurement of tiie wafer surface position that may be caused by interference between the beam reflected by the top 
surface of the resist layer and the beam refracted into the resist layer and reflected by its bottom surface. Otiierwise, 
the fourtii embodiment may be tiie same as any of the first to third embodiments described above. 
[0091] The interference of beams reflected from said top and bottom surface is largely dependent on the resist 
properties and wafer surface properties, as well as on the optical wavelength and angle of incidence of the measure- 
ment beam. Broadband light sources and detectors are currentiy used to average out such single-wavelength interfer- 
ence effects. Improvement of this averaging principle can be realized if the wafer surface position is measured in a 
spectrally resolved manner, whereby a distinct measurement Is performed for a number of wavelengths in the broad- 
band measurement beam. To achieve this, it is necessary to make a temporally or spatially separated wavelengtii 
(color) system for measuring the wafer surface position. This necessitates changes such as the following to the level 
sensor's measurement prindpla 

[0092] A first possible change to the level sensor is to replace the continuous broadband light source by one capa- 
ble of selectively generating light beams of different wavelength ranges (colors). This can. for exanple, be achieved by 
selectively interposing different color filters (e.g. on a carousel) at a suitable point in the level sensor's illumination sys- 
tem, by the use of several independentiy selectable light sources, by using a wavelength-tunable light source, or by 
using a selected beam portion from a rotating/vibrating prism located in a small broadband beam. The level sensor is 
then used to take several nrteasurements of tiie wafer surface at each point, using difterent wavelengtiis of light in the 
measurement beam. 

[0093] Another option is to replace the broadband detector by one capable of selectively detecting light of different 
wavelength ranges (colors). This can be achieved, for example, by placement of color filters in the detection optics 
before the detector, by spatially splitting tiie measurement beam for different wavelengths using a prism and then 
detecting the different- wavelength beams on separate detectors, or by any otiier way of spectrally analyzing the broad- 
band-reflected beam to measure tiie wafer surface position. 

[0094] Naturally, it is also possible to use a combined approach, whereby lx>tii the projection system and the detec- 
tion system are adapted to achieve spectral resolution. 

[0095] In the absence of interference effects, each measurement (for each wavelength) should give the same 
result; consequentiy. if different results are obtained in such measurements, tills indfoates tiie presence of effects as 
refen^ed to in the first paragraph above. An improved wafer surface position measurement can then be derived using a 
variety of techniques. For example, discrepant results may be con-ected or discarded. Majority voting techniques may 
also be used. Alternatively, on the basis of a spectral measurement of the wafer surface position, one might even derive 
real positions by means of a model describing the spectral response of the resist and the wafer surface properties. 
[0096] Since the described interference effect also depends on the angle of incidence of tiie measurement beam 
on the wafer surface, one might also want to vary this angle of incidence so as to evaluate the effect and then correct 
it. Accordingly, a further possible change to the level sensor is to adapt it such that the wafer surface position can be 
made using measurement beams at different angles of incidence. One way to achieve this is to define multiple meas- 
urement beams having different angles of incidence for the same spot on tiie wafer, but separate projection and detec- 
tion optics systems. Alternatively, one can change the optical system so that the same projection arxi detection systems 
encompass the different optical axes pertaining to the various measurement beams. Another option, which generates 
temporally varying angles of incidence. Is to use rotating/translating folding mirrors (or otiier movable components) in 
the optical systems of the level sensor. 

[0097] As with the wavelength dependence described at>ove. in tiie absence of interference effects, measurements 
at different angles of incidence should give the same result. Therefore, any discrepancies (variation with angle of ind- 
dence) can be avoided, compensated for, or modeled in the same way. 

[0098] The ak>ove-mentioned additional features and improvements may. of course, be used togetiier or separately, 
and in otiier optical sensors than tiiose described here. 
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Embodiin&it 5 

[0099] A fifth entbcxiiment of the invention is shown in Figure 21 . The fifth embodiment of the invention is a lithog* 
raphy apparatus employing, as the exposure radiation, extreme ultraviolet (EUV) radiation, e.g. of wavelength in tiie 
range of 9 to 16nm. and a reflective mask MA*. Functionally at least, the components of the fiftii embodiment are gen- 
erally the same as those of the first embodiment but they are adapted to the exposure radiation wavelength used and 
their arrangement is adjusted to accommodate the beam path necessitated by the use of a reflective mask. Particular 
adaptions that may be necessary include optimizing the illumination and projection optics IL\ PL* to the wavelength of 
the exposure radiation; ttiis will generally involve the use of r^lective rather than refractive optical elements. An exam- 
ple of an illumination optical system IL' for use with EUV radiation is described in European Patent Application 
00300784.6 (P-0129). 

[01 00] An Important difference between litiiography apparatus using reflective masks and those using transmissive 
masks, is tiiat with the reflective mask, unf latness of the mask results in position errors on the wafer tiiat are multiplied 
by tiie optical path lengtin of the downstaream optical system, i.e. tiie projection lens PL'. This is because height and/or 
tilt deviations of the mask locally change tiie effective angle of Incidence of the iilunftination beam on the mask and 
hence change the XY position of tiie image features on the wafer. 

[0101 ] According to the fifth anbodiment of the invention, the effects of unf latness of the mask are avoided or alle- 
viated by making a height map of the mask in advance of the exposure and controlling the mask position in at least one 
of Z. Rx and Ry during tiie exposure. The height map can be generated in a similar manner to that described above (i.e. 
off-axis levelling of the mask at a measurement station): however, it may also be generated with the mask at the expo- 
sure station, which may obviate tiie need to relate the height map to a physical reference suriace. The calculation of the 
optimum position(s} of tiie mask during tiie exposure or exposure scan (tiie exposure path) can be equivalent to that 
desaibed above, but it may also be a coupled optimization of wafer and mask exposure paths. However, for a masK it 
may be advantageous to place greater weight in the optimization calculations on tilt deviations, since these will have a 
greater effect on the position at the wafer. 

[0102] It should be expllcitiy noted tiiat a lithographic projection apparatus according to the current invention may 
contain two (or more) substrate tables and/or two (or more) mask tables. In such a scenario, it is possible for a first sut^ 
strata on a first substi^ate table to be undergoing height-mapping at tiie measurem^ station while a second substrate 
on a second substrate table is concurrently undergoing exposure at tiie exposure station; and similarly in the case of 
multiple mask tables. Such a construction can greatty increase throughput. 

[01 03] It should also be expUdtty noted tiiat the current invention can be applied to substrate leveling alone, to mask 
leveling alone, or to a combination of substrate leveling and mask leveling. 

[0104] Whilst we have described atx>ve specific emtxxJiments of the invention it will be appreciated tiiat tiie inven- 
tion may be practiced othenwise than as described. The description is not intended to limit the invention. 

Claims 

1 . A lithographic projection apparatus comprising: 

a radiatkin system for supplying a projection beam of radiation; 

a first object table provided with a mask holder for hoMing a mask; 

a second, movable object tat>te provided with a substrate holder for holding a substrate; 

a projection system for imaging an irradiated portion of tiie mask onto a target portion of the substrate; and 

a positioning system for moving said second object table between an exposure station, at which said projection 

system can image said mask portion onto said substrate, and a measurement station; characterized in that 

said second object table has a physical reference surface fixed thereto; 

and by: 

height mapping means located at said measurement station and constructed and arranged to measure tiie 
height, relative to sakJ physical reference surface, of a plurality of points on the surface of a substrate held on 
said substrate holder and to create a height map thereof; 

position measuring means located at said exposure station for measuring the position of said physical refer- 
ence surface in a first direction sut>stantially perpendicular to said sut)strate surface, after movement of said 
second object table to said exposure station; and 

control means constructed and anBnged to control the position of said second object table in at least said first 
direction, during exposure of said target portion, in accordance witti said height map and said position meas- 
ured by said position measuring means. 

2. Apparatus according to claim 1 wherein said control means further arranged to oorrtrol the tilt of sakJ second 
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object table about at least one axis perpendicular to said first direction In accordance with said height map. 

3. Apparatus according to claim 1 or 2 wherein said height mapping means comprises a level sensor constructed and 
arranged to simultaneously measure the position in said first direction of a linear array of points. 

4. Apparatus according to daim 1 , 2 or 3. wherein said height mapping means comprises a level sensor constructed 
and arranged to measure the position of a measurement beam reflected by the surface whose position in said first 
direction is to be measured. 

5. Apparatus according to claim 4 wherein said level sensor comprises: a projection grating; projection optics for pro- 
jecting an image of said projection grating onto the surface whose position in said first direction is to be measured: 
a detection grating, detection optics for focusing light reflected by said surfece to form on said detection grating an 
Image of said projection grating; and a detector for detecting Moir6 patterns formed by the overlay of said image of 
said projection grating on said detection grating. 

6. Apparatus according to claim 5 wherein sard level sensor further comprises a radiation source constructed and 
arranged to illuminate said projection grating with polychromatic radiation and wherein said projection optics and 
said detection optics consist essentially of reflective optical elements. 

7. Apparatus according to any one of the preceding claims wherein said height mapping means comprises a level 
sensor for detecting the position in said first direction of tiie surface of said substrate at said plurality of points and 
position detection means for detecting the position In said first direction of said second object table simultaneously 
with measurements by said level sensor. 

8. Apparatus according to claim 7 wherein said position detection means comprises an interferometer. 

9. Apparatus according to any one of the preceding claims wherein said position measuring means comprises an 
image sensor mounted to said second object table and said physical reference surface comprises an upper surface 
of said image sensor. 

1 0. Apparatus according to any one of the preceding claim, wherein said position measuring means Is constructed and 
an^anged to measure the position of said physical reference surface relative to tiie focal plane of said projection sys- 
tem. 

1 1 . Apparatus according to any one of tiie preceding claims wherein said second object table has a plurality of spaced- 
apart physical reference surfaces; and said height mapping means is constructed and arranged to measure the 
height of said plurality of points relative to a reference plane defined by said plurality of physical reference surfaces. 

12. Apparatus according to any one of tiie preceding claims furtiier comprising: 

a second height mapping means located at said exposure station constructed and anranged to measure the 
height, relative to said physical reference surface, of said plurality of points on the surface of a substrate held 
on said substrate holder and to create a height map thereof; and 

calibration means constructed and arranged to compare height maps of a single substrate prepared by each 
of said first and second height mapping means to derive a relative calibration for separate position detection 
systems provided at said measurement and exposure stations. 

13. A method of manufacturing a device using a lithographic projection apparatus comprising: 

a radiation system for supplying a projection beam of radiation; 
a first object table provided with a mask holder for holding a mask; 
a second, movable object tattle provided with a 8uk>strate holder for holding a substrate; and 
a projection system for imaging irradiated portions of the mask onto target portions of the substrate at an expo- 
sure station; the method comprising the steps of: 
providing a mask bearing a pattern to said first object table: 

providing a substrate having a radiation-sensitive layer to said second object table; and 

imaging said irradiated portions of the mask onto saM target portions of the suk>strate; characterized by the 

steps of: 
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before said step of imaging, generating, with the second object table at a measurement station, a height map 
indicating the height of a plurality of points on the substrate surfiace relative to a physical reference surface on 
said second object table; 

moving the second object tat)le to said exposure station and measuring the position of said physical reference 
surface in a first direction substantially perperxjicular to said substrate surface; and 

during said step of imaging, positioning the second object table in at least said first direction by reference to 
said height map and said measured position in said first direction of said physical reference surface. 

1 4. A method according to claim 1 3 wherein, during said step of imaging, said second object table is oriented about at 
least one axis perpendicular to first direction by reference to said height map. 

15. A method according to daim 13 or 14 wherein said second object table is positioned during said imaging step so 
as to minimize the squared defbcus integrated over the area of said target portion, wherein the defbcus comprises 
the distance in sakl first direction between tiie focal surface of said projection lens and the surface of said substrate. 

16. A method according to daim 13 or 14 wherein said step of imaging comprises scan imaging a slit image onto said 
substrate, arvi said second object table is positioned during said imaging step so as to minimize the squared defo- 
cus integrated over the duration of said scanning exposure and the area of said slit image, wherein the defbcus 
comprises the distance in said first direction between the focal surface of said projection lens and the surface of 
said substrate. 

17. A method according to any one of claims 13 to 16 wherein said step of generating a height map comprises tiie sub- 
steps of: 

measuring tiie position in said first direction of each of said plurality of points on said substrate surface; 
simultaneously with each measurement of the position of a point on said substrate surface, measuring the 
position in said first direction of said second object table; and 

subtracting each measured position of said second object table from the corresponding measured position of 
said substrate surface to generate said height map. 

18. A method according to daim 17 wherein said step of generating a height map comprises tiie initial step of meas- 
uring the position in said first direction of said physical reference surface and simultaneously tiie position in said 
first direction of said second object tat)le. 

19. A method according to any one of daims 13 to 18 comprising tiie further steps, before said step of generating a 
height rrmp, of: 

measuring the height of a plurality of points on said wafer surface adjacent the perimeter of areas on said sub- 
strate tiiat are to be exposed, and determining from the measured heights an overall height and tilt for said sub- 
strate and/or local height or tilt values in certain regions of said substrate surface whose height is to be 
mapped. 

20. A method according to any one of claims 13 to 19 further comprising the step, before said step of generating a 
height map. of calibrating a level sensor to be used in generating said height map by using said level sensor to 
make a plurality of measurements of the vertical position of at least one predetermined point on said substrate sur- 
face with the second object table being positioned at different vertical positions for different ones of said plurality of 
measurements. 

21. A method according to daim 20 wherein said step of calibrating is performed for a plurality of different exposure 
areas on said substirate and respective resulting calibration con'ections are applied in generating the height map for 
exposure areas conresponding in type to tiiose for which ttie calibration was performed. 

22. A device manufactured according to the mettiod of any one of daims 13 to 21 . 

23. A method of calibrating a lithographic projedion apparatus comprising: 

a radiation system for supplying a projection beam of radiation; 
a first objed table provided witti a mask holder for holding a mask; 
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a second, movable object table provided with a substrate holder for holding a substrate; and 

a measurement station having a first position detection system for measuring the position of said second object 

table at said measurement station; 

an exposure station having a projection system for imaging irradiated portrons of the mask onto target portions 
of the sut>strate and a second position detection system for measuring the position of said second object table 
at said exposure station; the method comprising the steps of: 
providing a substrate to said second ok>ject table; 

at said measurement station, generating a first height map of said substrate by measuring the position in a first 
direction, substantially perpendicular to the surface of said substrate, of a plurality of points on said sut>strate 
surfcice and simultaneously measuring the position of said second object table using said first position detec- 
tion system; 

at said exposure station, generating a second height map of said substi^ate by measuring the position in said 
first direction of said plurality of points on said substrate surface and simultaneously measuring the posKion of 
said second object table using said second position detection system; and 

comparing said first and second height maps to calibrate said first and second position detection systems. 

24. A metiiod of manufecturing devices using a lithographic projection apparatus comprising: 

a radiation system for supplying a projection beam of radiation; 
a first object table provided with a mask holder for holding a mask; 

a second, movable object table provided with a substrate holder for holding a substrate; and 

a projection system for imaging inadiated portions of the mask onto target portions of the substrate; the 

method comprising tiie steps of: 

providing a mask bearing a pattern to said first object table; 

providing a substrate having a radiation-sensitive layer to said second object table; and 
imaging said irradiated portions of the mask onto said target portions of the substrate; 
said steps of providing a substrate and imaging being repeated to expose a plurality of substi-ates; character- 
ized by tiie steps of: 

generating, for each substrate provided to second object table, a height map indicating tiie height of a plurality 
of points on the subsb^ate surface; and comparing the height maps of successively provided substrates to 
detect correlations in the locations of any unflatnesses that may be indicative of contamination or systematic 
feults of said second object table. 

25. A lithographic projection apparatus comprising: 

a radiation system for supplying a projection beam of radiation; 

a first. movat)le object table provided with a mask holder for holding a reflective mask; 

a second, object table provided witii a substi-ate holder for holding a substrate; and 

a projection system for imaging an irradiated portion of the mask onto a target portion of tiie substrate; char- 
acterized by 

height mapping means constructed and arranged to measure the height, relative to a reference surface, of a 
plurality of points on tiie plane of a reflective mask held on said mask holder and to create a height map 
thereof; and 

control means consti-ucted and arranged to control the position of said first object table in at least said first 
direction, during exposure of said target portion, in accordance with said height map. 

26. A method of manufacturing a device using a lithographic projection apparatus comprising: 

a radiation system for supplying a projection beam of radiation; 

a first, movable object table provided with a mask holder for holding a reflective mask; 

a second object table provided with a substrate holder for holding a substrate; and 

a projection system for imaging irradiated portions of tiie mask onto target portions of ttie substrate; the 
method comprising tiie steps of: 

providing a reflective mask bearing a pattern to said first object table; 

providing a substrate having a radiation-sensitive layer to said second object table; and 

imaging said irradiated portions of tiie mask onto said target portions of the substrate; characterized by the 

steps of: 

t>etbre said step of imaging, generating, a height map indicating the height of a plurality of points on tiie mask 
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surface relative to a reference plane on said first object table; and 

during said step of Imaging, positioning the first object table In at least said first direction by reference to said 
height map. 
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